Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 412722 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 50.4 MiB |
| Average record size in memory | 128.0 B |
Variable types
| Numeric | 13 |
|---|---|
| DateTime | 1 |
| Categorical | 2 |
markdown1 is highly overall correlated with markdown2 and 3 other fields | High correlation |
markdown2 is highly overall correlated with markdown1 and 3 other fields | High correlation |
markdown3 is highly overall correlated with markdown1 and 3 other fields | High correlation |
markdown4 is highly overall correlated with markdown1 and 3 other fields | High correlation |
markdown5 is highly overall correlated with markdown1 and 3 other fields | High correlation |
size is highly overall correlated with type | High correlation |
store is highly overall correlated with type | High correlation |
type is highly overall correlated with size and 1 other fields | High correlation |
holiday is highly imbalanced (63.4%) | Imbalance |
markdown1 has 265389 (64.3%) zeros | Zeros |
markdown2 has 305692 (74.1%) zeros | Zeros |
markdown3 has 279200 (67.6%) zeros | Zeros |
markdown4 has 281031 (68.1%) zeros | Zeros |
markdown5 has 264638 (64.1%) zeros | Zeros |
Reproduction
| Analysis started | 2025-03-10 16:24:33.679614 |
|---|---|
| Analysis finished | 2025-03-10 16:25:04.760516 |
| Duration | 31.08 seconds |
| Software version | ydata-profiling vv4.10.0 |
| Download configuration | config.json |
store
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 45 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.295819 |
| Minimum | 1 |
|---|---|
| Maximum | 45 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 11 |
| median | 22 |
| Q3 | 33 |
| 95-th percentile | 43 |
| Maximum | 45 |
| Range | 44 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 12.785237 |
|---|---|
| Coefficient of variation (CV) | 0.57343652 |
| Kurtosis | -1.1490385 |
| Mean | 22.295819 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 0.070698815 |
| Sum | 9201975 |
| Variance | 163.46228 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 34 | 10213 | 2.5% |
| 32 | 9973 | 2.4% |
| 11 | 9967 | 2.4% |
| 23 | 9959 | 2.4% |
| 24 | 9949 | 2.4% |
| 6 | 9902 | 2.4% |
| 15 | 9895 | 2.4% |
| 8 | 9886 | 2.4% |
| 40 | 9878 | 2.4% |
| 28 | 9873 | 2.4% |
| Other values (35) | 313227 |
| Value | Count | Frequency (%) |
| 1 | 9830 | |
| 2 | 9622 | |
| 3 | 8904 | |
| 4 | 9569 | |
| 5 | 8997 | |
| 6 | 9902 | |
| 7 | 9758 | |
| 8 | 9886 | |
| 9 | 8834 | |
| 10 | 9606 |
| Value | Count | Frequency (%) |
| 45 | 9629 | |
| 44 | 7169 | |
| 43 | 6671 | |
| 42 | 6885 | |
| 41 | 9832 | |
| 40 | 9878 | |
| 39 | 9532 | |
| 38 | 7356 | |
| 37 | 7192 | |
| 36 | 6222 |
date
Date
| Distinct | 143 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.1 MiB |
| Minimum | 2010-02-05 00:00:00 |
|---|---|
| Maximum | 2012-10-26 00:00:00 |
temperature
Real number (ℝ)
| Distinct | 3528 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 60.110237 |
| Minimum | -2.06 |
|---|---|
| Maximum | 100.14 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 69 |
| Negative (%) | < 0.1% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | -2.06 |
|---|---|
| 5-th percentile | 27.28 |
| Q1 | 46.73 |
| median | 62.11 |
| Q3 | 74.29 |
| 95-th percentile | 87.27 |
| Maximum | 100.14 |
| Range | 102.2 |
| Interquartile range (IQR) | 27.56 |
Descriptive statistics
| Standard deviation | 18.454227 |
|---|---|
| Coefficient of variation (CV) | 0.30700639 |
| Kurtosis | -0.63099911 |
| Mean | 60.110237 |
| Median Absolute Deviation (MAD) | 13.61 |
| Skewness | -0.3238073 |
| Sum | 24808817 |
| Variance | 340.5585 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50.43 | 697 | 0.2% |
| 67.87 | 636 | 0.2% |
| 72.62 | 574 | 0.1% |
| 76.67 | 568 | 0.1% |
| 70.28 | 551 | 0.1% |
| 76.03 | 535 | 0.1% |
| 50.56 | 532 | 0.1% |
| 64.05 | 530 | 0.1% |
| 64.21 | 514 | 0.1% |
| 50.81 | 474 | 0.1% |
| Other values (3518) | 407111 |
| Value | Count | Frequency (%) |
| -2.06 | 69 | |
| 5.54 | 67 | |
| 6.23 | 67 | |
| 7.46 | 69 | |
| 9.51 | 69 | |
| 9.55 | 67 | |
| 10.09 | 66 | |
| 10.11 | 68 | |
| 10.24 | 69 | |
| 10.53 | 72 |
| Value | Count | Frequency (%) |
| 100.14 | 44 | < 0.1% |
| 100.07 | 46 | < 0.1% |
| 99.66 | 48 | < 0.1% |
| 99.22 | 184 | |
| 99.2 | 46 | < 0.1% |
| 98.43 | 43 | < 0.1% |
| 98.15 | 47 | < 0.1% |
| 97.66 | 42 | < 0.1% |
| 97.6 | 48 | < 0.1% |
| 97.18 | 186 |
fuel_price
Real number (ℝ)
| Distinct | 892 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.3611193 |
| Minimum | 2.472 |
|---|---|
| Maximum | 4.468 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 2.472 |
|---|---|
| 5-th percentile | 2.653 |
| Q1 | 2.932 |
| median | 3.452 |
| Q3 | 3.738 |
| 95-th percentile | 4.029 |
| Maximum | 4.468 |
| Range | 1.996 |
| Interquartile range (IQR) | 0.806 |
Descriptive statistics
| Standard deviation | 0.45881301 |
|---|---|
| Coefficient of variation (CV) | 0.13650602 |
| Kurtosis | -1.1869028 |
| Mean | 3.3611193 |
| Median Absolute Deviation (MAD) | 0.375 |
| Skewness | -0.10584778 |
| Sum | 1387207.9 |
| Variance | 0.21050938 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3.638 | 2501 | 0.6% |
| 3.63 | 2112 | 0.5% |
| 2.771 | 1884 | 0.5% |
| 3.891 | 1812 | 0.4% |
| 3.594 | 1768 | 0.4% |
| 3.524 | 1764 | 0.4% |
| 3.523 | 1761 | 0.4% |
| 2.72 | 1761 | 0.4% |
| 3.666 | 1745 | 0.4% |
| 2.78 | 1621 | 0.4% |
| Other values (882) | 393993 |
| Value | Count | Frequency (%) |
| 2.472 | 38 | < 0.1% |
| 2.513 | 45 | < 0.1% |
| 2.514 | 886 | |
| 2.52 | 39 | < 0.1% |
| 2.533 | 42 | < 0.1% |
| 2.539 | 37 | < 0.1% |
| 2.54 | 142 | < 0.1% |
| 2.542 | 45 | < 0.1% |
| 2.545 | 38 | < 0.1% |
| 2.548 | 879 |
| Value | Count | Frequency (%) |
| 4.468 | 360 | |
| 4.449 | 352 | |
| 4.308 | 163 | |
| 4.301 | 355 | |
| 4.294 | 358 | |
| 4.293 | 191 | |
| 4.288 | 167 | |
| 4.282 | 165 | |
| 4.277 | 350 | |
| 4.273 | 358 |
markdown1
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 2278 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2568.5946 |
| Minimum | 0 |
|---|---|
| Maximum | 88646.76 |
| Zeros | 265389 |
| Zeros (%) | 64.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 2779.97 |
| 95-th percentile | 12297.04 |
| Maximum | 88646.76 |
| Range | 88646.76 |
| Interquartile range (IQR) | 2779.97 |
Descriptive statistics
| Standard deviation | 6013.5604 |
|---|---|
| Coefficient of variation (CV) | 2.3411871 |
| Kurtosis | 35.171293 |
| Mean | 2568.5946 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.743194 |
| Sum | 1.0601155 × 109 |
| Variance | 36162909 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 265389 | |
| 460.73 | 102 | < 0.1% |
| 1.5 | 102 | < 0.1% |
| 175.64 | 93 | < 0.1% |
| 6438.2 | 74 | < 0.1% |
| 2732.08 | 73 | < 0.1% |
| 708.33 | 73 | < 0.1% |
| 4611.57 | 73 | < 0.1% |
| 16404.25 | 72 | < 0.1% |
| 11028.34 | 72 | < 0.1% |
| Other values (2268) | 146599 |
| Value | Count | Frequency (%) |
| 0 | 265389 | |
| 0.27 | 51 | < 0.1% |
| 0.5 | 49 | < 0.1% |
| 1.5 | 102 | < 0.1% |
| 1.94 | 50 | < 0.1% |
| 2.12 | 51 | < 0.1% |
| 2.4 | 49 | < 0.1% |
| 2.42 | 49 | < 0.1% |
| 2.43 | 51 | < 0.1% |
| 2.8 | 50 | < 0.1% |
| Value | Count | Frequency (%) |
| 88646.76 | 68 | |
| 78124.5 | 64 | |
| 75149.79 | 66 | |
| 65021.23 | 71 | |
| 62567.6 | 66 | |
| 62172.73 | 69 | |
| 60740.64 | 70 | |
| 60394.73 | 68 | |
| 58928.52 | 63 | |
| 56917.7 | 71 |
markdown2
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 1481 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 871.95063 |
| Minimum | 0 |
|---|---|
| Maximum | 104519.54 |
| Zeros | 305692 |
| Zeros (%) | 74.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1.91 |
| 95-th percentile | 3744.31 |
| Maximum | 104519.54 |
| Range | 104519.54 |
| Interquartile range (IQR) | 1.91 |
Descriptive statistics
| Standard deviation | 5045.0042 |
|---|---|
| Coefficient of variation (CV) | 5.7858828 |
| Kurtosis | 145.62963 |
| Mean | 871.95063 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 10.650619 |
| Sum | 3.5987321 × 108 |
| Variance | 25452067 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 305692 | |
| 1.91 | 533 | 0.1% |
| 3 | 492 | 0.1% |
| 0.5 | 482 | 0.1% |
| 1.5 | 466 | 0.1% |
| 4 | 365 | 0.1% |
| 6 | 361 | 0.1% |
| 3.82 | 346 | 0.1% |
| 7.64 | 345 | 0.1% |
| 19 | 339 | 0.1% |
| Other values (1471) | 103301 | 25.0% |
| Value | Count | Frequency (%) |
| 0 | 305692 | |
| 0.02 | 96 | < 0.1% |
| 0.03 | 206 | < 0.1% |
| 0.09 | 137 | < 0.1% |
| 0.11 | 68 | < 0.1% |
| 0.15 | 138 | < 0.1% |
| 0.18 | 205 | < 0.1% |
| 0.24 | 131 | < 0.1% |
| 0.27 | 68 | < 0.1% |
| 0.3 | 135 | < 0.1% |
| Value | Count | Frequency (%) |
| 104519.54 | 67 | |
| 97740.99 | 68 | |
| 92523.94 | 68 | |
| 89121.94 | 70 | |
| 82881.16 | 71 | |
| 72413.71 | 69 | |
| 70574.85 | 66 | |
| 58804.91 | 69 | |
| 58046.41 | 71 | |
| 56106.2 | 70 |
markdown3
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 1658 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 451.59872 |
| Minimum | 0 |
|---|---|
| Maximum | 141630.61 |
| Zeros | 279200 |
| Zeros (%) | 67.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 4.33 |
| 95-th percentile | 210.34 |
| Maximum | 141630.61 |
| Range | 141630.61 |
| Interquartile range (IQR) | 4.33 |
Descriptive statistics
| Standard deviation | 5407.7388 |
|---|---|
| Coefficient of variation (CV) | 11.974655 |
| Kurtosis | 256.34454 |
| Mean | 451.59872 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 15.159304 |
| Sum | 1.8638473 × 108 |
| Variance | 29243639 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 279200 | |
| 3 | 740 | 0.2% |
| 6 | 691 | 0.2% |
| 2 | 656 | 0.2% |
| 1 | 607 | 0.1% |
| 0.22 | 473 | 0.1% |
| 0.5 | 463 | 0.1% |
| 4 | 437 | 0.1% |
| 0.01 | 433 | 0.1% |
| 3.2 | 373 | 0.1% |
| Other values (1648) | 128649 |
| Value | Count | Frequency (%) |
| 0 | 279200 | |
| 0.01 | 433 | 0.1% |
| 0.02 | 120 | < 0.1% |
| 0.04 | 238 | 0.1% |
| 0.05 | 69 | < 0.1% |
| 0.06 | 203 | < 0.1% |
| 0.09 | 69 | < 0.1% |
| 0.12 | 68 | < 0.1% |
| 0.13 | 53 | < 0.1% |
| 0.15 | 247 | 0.1% |
| Value | Count | Frequency (%) |
| 141630.61 | 67 | |
| 109030.75 | 69 | |
| 103991.94 | 69 | |
| 101378.79 | 66 | |
| 89402.64 | 65 | |
| 88805.58 | 68 | |
| 83340.33 | 66 | |
| 83192.81 | 69 | |
| 79621.2 | 66 | |
| 77451.26 | 66 |
markdown4
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 1945 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1070.9479 |
| Minimum | 0 |
|---|---|
| Maximum | 67474.85 |
| Zeros | 281031 |
| Zeros (%) | 68.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 421.71 |
| 95-th percentile | 5124.86 |
| Maximum | 67474.85 |
| Range | 67474.85 |
| Interquartile range (IQR) | 421.71 |
Descriptive statistics
| Standard deviation | 3861.6797 |
|---|---|
| Coefficient of variation (CV) | 3.6058521 |
| Kurtosis | 87.202343 |
| Mean | 1070.9479 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.1165747 |
| Sum | 4.4200376 × 108 |
| Variance | 14912570 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 281031 | |
| 9 | 274 | 0.1% |
| 4 | 196 | < 0.1% |
| 2 | 188 | < 0.1% |
| 3 | 142 | < 0.1% |
| 47 | 141 | < 0.1% |
| 657.56 | 139 | < 0.1% |
| 1330.36 | 138 | < 0.1% |
| 67.72 | 137 | < 0.1% |
| 17 | 137 | < 0.1% |
| Other values (1935) | 130199 |
| Value | Count | Frequency (%) |
| 0 | 281031 | |
| 0.22 | 57 | < 0.1% |
| 0.41 | 52 | < 0.1% |
| 0.46 | 48 | < 0.1% |
| 0.78 | 52 | < 0.1% |
| 0.87 | 49 | < 0.1% |
| 0.92 | 45 | < 0.1% |
| 1.5 | 55 | < 0.1% |
| 1.88 | 48 | < 0.1% |
| 1.98 | 44 | < 0.1% |
| Value | Count | Frequency (%) |
| 67474.85 | 69 | |
| 57817.56 | 68 | |
| 57815.43 | 68 | |
| 53603.99 | 63 | |
| 52739.02 | 69 | |
| 48403.53 | 70 | |
| 48159.86 | 66 | |
| 48086.64 | 65 | |
| 47452.43 | 71 | |
| 46238.28 | 69 |
markdown5
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 2294 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1647.0238 |
| Minimum | 0 |
|---|---|
| Maximum | 108519.28 |
| Zeros | 264638 |
| Zeros (%) | 64.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 2143.91 |
| 95-th percentile | 7373.13 |
| Maximum | 108519.28 |
| Range | 108519.28 |
| Interquartile range (IQR) | 2143.91 |
Descriptive statistics
| Standard deviation | 4180.2215 |
|---|---|
| Coefficient of variation (CV) | 2.5380456 |
| Kurtosis | 185.87731 |
| Mean | 1647.0238 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 10.034612 |
| Sum | 6.7976296 × 108 |
| Variance | 17474252 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 264638 | |
| 2743.18 | 136 | < 0.1% |
| 1064.56 | 120 | < 0.1% |
| 3839.19 | 74 | < 0.1% |
| 2116.1 | 73 | < 0.1% |
| 17316.01 | 73 | < 0.1% |
| 2456.79 | 73 | < 0.1% |
| 28803.28 | 72 | < 0.1% |
| 4169.76 | 72 | < 0.1% |
| 653.02 | 72 | < 0.1% |
| Other values (2284) | 147319 |
| Value | Count | Frequency (%) |
| 0 | 264638 | |
| 135.16 | 64 | < 0.1% |
| 153.04 | 46 | < 0.1% |
| 153.9 | 49 | < 0.1% |
| 164.08 | 52 | < 0.1% |
| 170.64 | 68 | < 0.1% |
| 171.76 | 70 | < 0.1% |
| 180.07 | 64 | < 0.1% |
| 212.75 | 49 | < 0.1% |
| 224.86 | 50 | < 0.1% |
| Value | Count | Frequency (%) |
| 108519.28 | 65 | |
| 105223.11 | 68 | |
| 85851.87 | 66 | |
| 63005.58 | 66 | |
| 58068.14 | 69 | |
| 57029.78 | 68 | |
| 53212.72 | 68 | |
| 37581.27 | 70 | |
| 36430.33 | 68 | |
| 36360.42 | 65 |
cpi
Real number (ℝ)
| Distinct | 2145 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 171.20171 |
| Minimum | 126.064 |
|---|---|
| Maximum | 227.23281 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 126.064 |
|---|---|
| 5-th percentile | 126.49626 |
| Q1 | 132.06443 |
| median | 182.31878 |
| Q3 | 212.51859 |
| 95-th percentile | 221.94916 |
| Maximum | 227.23281 |
| Range | 101.16881 |
| Interquartile range (IQR) | 80.45416 |
Descriptive statistics
| Standard deviation | 39.167528 |
|---|---|
| Coefficient of variation (CV) | 0.22878001 |
| Kurtosis | -1.8298277 |
| Mean | 171.20171 |
| Median Absolute Deviation (MAD) | 41.483671 |
| Skewness | 0.086115801 |
| Sum | 70658711 |
| Variance | 1534.0952 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 130.0710323 | 691 | 0.2% |
| 131.1083333 | 690 | 0.2% |
| 131.043 | 688 | 0.2% |
| 128.9998667 | 688 | 0.2% |
| 131.0756667 | 687 | 0.2% |
| 130.683 | 687 | 0.2% |
| 130.737871 | 686 | 0.2% |
| 131.1453333 | 686 | 0.2% |
| 130.7929 | 686 | 0.2% |
| 129.8459667 | 686 | 0.2% |
| Other values (2135) | 405847 |
| Value | Count | Frequency (%) |
| 126.064 | 665 | |
| 126.0766452 | 665 | |
| 126.0854516 | 660 | |
| 126.0892903 | 670 | |
| 126.1019355 | 672 | |
| 126.1069032 | 669 | |
| 126.1119032 | 668 | |
| 126.114 | 674 | |
| 126.1145806 | 673 | |
| 126.1266 | 673 |
| Value | Count | Frequency (%) |
| 227.2328068 | 63 | |
| 227.214288 | 62 | |
| 227.1693919 | 63 | |
| 227.0369359 | 70 | |
| 227.0184166 | 69 | |
| 226.9873637 | 134 | |
| 226.9735448 | 69 | |
| 226.9688442 | 133 | |
| 226.9662325 | 63 | |
| 226.9239785 | 133 |
unemployment
Real number (ℝ)
| Distinct | 349 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.9655219 |
| Minimum | 3.879 |
|---|---|
| Maximum | 14.313 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 3.879 |
|---|---|
| 5-th percentile | 5.326 |
| Q1 | 6.891 |
| median | 7.866 |
| Q3 | 8.572 |
| 95-th percentile | 12.187 |
| Maximum | 14.313 |
| Range | 10.434 |
| Interquartile range (IQR) | 1.681 |
Descriptive statistics
| Standard deviation | 1.8700957 |
|---|---|
| Coefficient of variation (CV) | 0.23477378 |
| Kurtosis | 2.690108 |
| Mean | 7.9655219 |
| Median Absolute Deviation (MAD) | 0.859 |
| Skewness | 1.1817528 |
| Sum | 3287546.1 |
| Variance | 3.4972578 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8.099 | 5044 | 1.2% |
| 7.852 | 3522 | 0.9% |
| 8.163 | 3520 | 0.9% |
| 7.343 | 3322 | 0.8% |
| 7.931 | 3310 | 0.8% |
| 7.057 | 3304 | 0.8% |
| 6.565 | 3285 | 0.8% |
| 7.441 | 3279 | 0.8% |
| 8.2 | 3273 | 0.8% |
| 6.891 | 3267 | 0.8% |
| Other values (339) | 377596 |
| Value | Count | Frequency (%) |
| 3.879 | 265 | 0.1% |
| 4.077 | 873 | |
| 4.125 | 1816 | |
| 4.145 | 557 | 0.1% |
| 4.156 | 1795 | |
| 4.261 | 1813 | |
| 4.308 | 868 | |
| 4.42 | 1814 | |
| 4.584 | 1969 | |
| 4.607 | 854 |
| Value | Count | Frequency (%) |
| 14.313 | 2605 | |
| 14.18 | 2404 | |
| 14.099 | 2422 | |
| 14.021 | 2242 | |
| 13.975 | 1509 | |
| 13.736 | 2445 | |
| 13.503 | 2637 | |
| 12.89 | 2456 | |
| 12.187 | 2474 | |
| 11.627 | 2478 |
dept
Real number (ℝ)
| Distinct | 81 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 43.648211 |
| Minimum | 1 |
|---|---|
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 18 |
| median | 36 |
| Q3 | 72 |
| 95-th percentile | 95 |
| Maximum | 99 |
| Range | 98 |
| Interquartile range (IQR) | 54 |
Descriptive statistics
| Standard deviation | 30.19218 |
|---|---|
| Coefficient of variation (CV) | 0.69171633 |
| Kurtosis | -1.176012 |
| Mean | 43.648211 |
| Median Absolute Deviation (MAD) | 23 |
| Skewness | 0.38401241 |
| Sum | 18014577 |
| Variance | 911.56775 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 6435 | 1.6% |
| 13 | 6435 | 1.6% |
| 10 | 6435 | 1.6% |
| 81 | 6435 | 1.6% |
| 21 | 6435 | 1.6% |
| 67 | 6435 | 1.6% |
| 79 | 6435 | 1.6% |
| 46 | 6431 | 1.6% |
| 74 | 6430 | 1.6% |
| 11 | 6426 | 1.6% |
| Other values (71) | 348390 |
| Value | Count | Frequency (%) |
| 1 | 6386 | |
| 2 | 6049 | |
| 3 | 6410 | |
| 4 | 6435 | |
| 5 | 6241 | |
| 6 | 5986 | |
| 7 | 6255 | |
| 8 | 6310 | |
| 9 | 6332 | |
| 10 | 6435 |
| Value | Count | Frequency (%) |
| 99 | 862 | 0.2% |
| 98 | 5836 | |
| 97 | 6278 | |
| 96 | 4854 | |
| 95 | 4483 | |
| 94 | 5631 | |
| 93 | 5880 | |
| 92 | 3973 | |
| 91 | 6200 | |
| 90 | 5432 |
weekly_sales
Real number (ℝ)
| Distinct | 350619 |
|---|---|
| Distinct (%) | 85.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13852.381 |
| Minimum | -4988.94 |
|---|---|
| Maximum | 84112.78 |
| Zeros | 73 |
| Zeros (%) | < 0.1% |
| Negative | 1285 |
| Negative (%) | 0.3% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | -4988.94 |
|---|---|
| 5-th percentile | 57.6705 |
| Q1 | 1997.7525 |
| median | 7295.49 |
| Q3 | 18978.175 |
| 95-th percentile | 52321.189 |
| Maximum | 84112.78 |
| Range | 89101.72 |
| Interquartile range (IQR) | 16980.423 |
Descriptive statistics
| Standard deviation | 16884.597 |
|---|---|
| Coefficient of variation (CV) | 1.218895 |
| Kurtosis | 2.9635412 |
| Mean | 13852.381 |
| Median Absolute Deviation (MAD) | 6441.185 |
| Skewness | 1.7980414 |
| Sum | 5.7171823 × 109 |
| Variance | 2.8508963 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 353 | 0.1% |
| 5 | 289 | 0.1% |
| 20 | 232 | 0.1% |
| 15 | 215 | 0.1% |
| 12 | 175 | < 0.1% |
| 1 | 169 | < 0.1% |
| 10.47 | 167 | < 0.1% |
| 11.97 | 154 | < 0.1% |
| 2 | 148 | < 0.1% |
| 7 | 146 | < 0.1% |
| Other values (350609) | 410674 |
| Value | Count | Frequency (%) |
| -4988.94 | 1 | < 0.1% |
| -3924 | 1 | < 0.1% |
| -1750 | 1 | < 0.1% |
| -1699 | 1 | < 0.1% |
| -1321.48 | 1 | < 0.1% |
| -1098 | 3 | |
| -1008.96 | 1 | < 0.1% |
| -898 | 1 | < 0.1% |
| -863 | 1 | < 0.1% |
| -798 | 4 |
| Value | Count | Frequency (%) |
| 84112.78 | 1 | |
| 84110.58 | 1 | |
| 84110.3 | 1 | |
| 84106.09 | 1 | |
| 84102.25 | 1 | |
| 84099.9 | 1 | |
| 84097.87 | 1 | |
| 84097.64 | 1 | |
| 84092.89 | 1 | |
| 84090.01 | 1 |
holiday
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.1 MiB |
| 0 | |
|---|---|
| 1 | 28918 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 412722 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 383804 | |
| 1 | 28918 | 7.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 383804 | |
| 1 | 28918 | 7.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 383804 | |
| 1 | 28918 | 7.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 412722 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 383804 | |
| 1 | 28918 | 7.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 412722 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 383804 | |
| 1 | 28918 | 7.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 412722 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 383804 | |
| 1 | 28918 | 7.0% |
type
Categorical
HIGH CORRELATION 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.1 MiB |
| 0 | |
|---|---|
| 1 | |
| 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 412722 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 208029 | |
| 1 | 162264 | |
| 2 | 42429 | 10.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 208029 | |
| 1 | 162264 | |
| 2 | 42429 | 10.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 208029 | |
| 1 | 162264 | |
| 2 | 42429 | 10.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 412722 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 208029 | |
| 1 | 162264 | |
| 2 | 42429 | 10.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 412722 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 208029 | |
| 1 | 162264 | |
| 2 | 42429 | 10.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 412722 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 208029 | |
| 1 | 162264 | |
| 2 | 42429 | 10.3% |
size
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 40 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 135728.18 |
| Minimum | 34875 |
|---|---|
| Maximum | 219622 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 34875 |
|---|---|
| 5-th percentile | 39690 |
| Q1 | 93638 |
| median | 128107 |
| Q3 | 202505 |
| 95-th percentile | 206302 |
| Maximum | 219622 |
| Range | 184747 |
| Interquartile range (IQR) | 108867 |
Descriptive statistics
| Standard deviation | 60954.044 |
|---|---|
| Coefficient of variation (CV) | 0.44908907 |
| Kurtosis | -1.2164887 |
| Mean | 135728.18 |
| Median Absolute Deviation (MAD) | 70910 |
| Skewness | -0.30318316 |
| Sum | 5.6018008 × 1010 |
| Variance | 3.7153955 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 39690 | 20728 | 5.0% |
| 39910 | 20583 | 5.0% |
| 203819 | 19741 | 4.8% |
| 158114 | 10213 | 2.5% |
| 203007 | 9973 | 2.4% |
| 207499 | 9967 | 2.4% |
| 114533 | 9959 | 2.4% |
| 202505 | 9902 | 2.4% |
| 123737 | 9895 | 2.4% |
| 155078 | 9886 | 2.4% |
| Other values (30) | 281875 |
| Value | Count | Frequency (%) |
| 34875 | 8997 | |
| 37392 | 8904 | |
| 39690 | 20728 | |
| 39910 | 20583 | |
| 41062 | 6671 | 1.6% |
| 42988 | 7156 | 1.7% |
| 57197 | 9435 | |
| 70713 | 9758 | |
| 93188 | 9812 | |
| 93638 | 9450 |
| Value | Count | Frequency (%) |
| 219622 | 9813 | |
| 207499 | 9967 | |
| 206302 | 9873 | |
| 205863 | 9569 | |
| 204184 | 9710 | |
| 203819 | 19741 | |
| 203750 | 9704 | |
| 203742 | 9393 | |
| 203007 | 9973 | |
| 202505 | 9902 |
| cpi | dept | fuel_price | holiday | markdown1 | markdown2 | markdown3 | markdown4 | markdown5 | size | store | temperature | type | unemployment | weekly_sales | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| cpi | 1.000 | -0.013 | -0.042 | 0.012 | 0.185 | 0.150 | 0.168 | 0.176 | 0.195 | -0.000 | -0.236 | 0.173 | 0.185 | -0.388 | -0.022 |
| dept | -0.013 | 1.000 | 0.003 | 0.000 | 0.003 | 0.002 | 0.003 | 0.001 | 0.003 | -0.006 | 0.017 | 0.001 | 0.082 | 0.008 | -0.049 |
| fuel_price | -0.042 | 0.003 | 1.000 | 0.137 | 0.469 | 0.296 | 0.403 | 0.428 | 0.444 | 0.004 | 0.074 | 0.127 | 0.089 | -0.058 | 0.003 |
| holiday | 0.012 | 0.000 | 0.137 | 1.000 | 0.039 | 0.220 | 0.281 | 0.072 | 0.033 | 0.000 | 0.000 | 0.187 | 0.000 | 0.035 | 0.009 |
| markdown1 | 0.185 | 0.003 | 0.469 | 0.039 | 1.000 | 0.796 | 0.903 | 0.953 | 0.968 | 0.075 | -0.034 | -0.019 | 0.093 | -0.227 | 0.026 |
| markdown2 | 0.150 | 0.002 | 0.296 | 0.220 | 0.796 | 1.000 | 0.725 | 0.789 | 0.787 | 0.102 | -0.056 | -0.129 | 0.045 | -0.183 | 0.032 |
| markdown3 | 0.168 | 0.003 | 0.403 | 0.281 | 0.903 | 0.725 | 1.000 | 0.869 | 0.905 | 0.086 | -0.029 | -0.073 | 0.037 | -0.206 | 0.034 |
| markdown4 | 0.176 | 0.001 | 0.428 | 0.072 | 0.953 | 0.789 | 0.869 | 1.000 | 0.924 | 0.137 | -0.095 | -0.034 | 0.053 | -0.219 | 0.053 |
| markdown5 | 0.195 | 0.003 | 0.444 | 0.033 | 0.968 | 0.787 | 0.905 | 0.924 | 1.000 | 0.082 | -0.021 | -0.028 | 0.057 | -0.241 | 0.026 |
| size | -0.000 | -0.006 | 0.004 | 0.000 | 0.075 | 0.102 | 0.086 | 0.137 | 0.082 | 1.000 | -0.157 | -0.043 | 0.851 | -0.064 | 0.275 |
| store | -0.236 | 0.017 | 0.074 | 0.000 | -0.034 | -0.056 | -0.029 | -0.095 | -0.021 | -0.157 | 1.000 | -0.056 | 0.540 | 0.296 | -0.094 |
| temperature | 0.173 | 0.001 | 0.127 | 0.187 | -0.019 | -0.129 | -0.073 | -0.034 | -0.028 | -0.043 | -0.056 | 1.000 | 0.124 | 0.030 | -0.018 |
| type | 0.185 | 0.082 | 0.089 | 0.000 | 0.093 | 0.045 | 0.037 | 0.053 | 0.057 | 0.851 | 0.540 | 0.124 | 1.000 | 0.179 | 0.156 |
| unemployment | -0.388 | 0.008 | -0.058 | 0.035 | -0.227 | -0.183 | -0.206 | -0.219 | -0.241 | -0.064 | 0.296 | 0.030 | 0.179 | 1.000 | -0.014 |
| weekly_sales | -0.022 | -0.049 | 0.003 | 0.009 | 0.026 | 0.032 | 0.034 | 0.053 | 0.026 | 0.275 | -0.094 | -0.018 | 0.156 | -0.014 | 1.000 |
| store | date | temperature | fuel_price | markdown1 | markdown2 | markdown3 | markdown4 | markdown5 | cpi | unemployment | dept | weekly_sales | holiday | type | size | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 2010-02-05 | 42.31 | 2.572 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 211.096358 | 8.106 | 1 | 24924.50 | 0 | 0 | 151315 |
| 1 | 35 | 2010-02-05 | 27.19 | 2.784 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 135.352461 | 9.262 | 3 | 14612.19 | 0 | 1 | 103681 |
| 2 | 35 | 2010-02-05 | 27.19 | 2.784 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 135.352461 | 9.262 | 4 | 26323.15 | 0 | 1 | 103681 |
| 3 | 35 | 2010-02-05 | 27.19 | 2.784 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 135.352461 | 9.262 | 5 | 36414.63 | 0 | 1 | 103681 |
| 4 | 35 | 2010-02-05 | 27.19 | 2.784 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 135.352461 | 9.262 | 6 | 11437.81 | 0 | 1 | 103681 |
| 5 | 35 | 2010-02-05 | 27.19 | 2.784 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 135.352461 | 9.262 | 7 | 23416.24 | 0 | 1 | 103681 |
| 6 | 35 | 2010-02-05 | 27.19 | 2.784 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 135.352461 | 9.262 | 8 | 27545.38 | 0 | 1 | 103681 |
| 7 | 35 | 2010-02-05 | 27.19 | 2.784 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 135.352461 | 9.262 | 9 | 12454.61 | 0 | 1 | 103681 |
| 8 | 35 | 2010-02-05 | 27.19 | 2.784 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 135.352461 | 9.262 | 10 | 15052.46 | 0 | 1 | 103681 |
| 9 | 35 | 2010-02-05 | 27.19 | 2.784 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 135.352461 | 9.262 | 2 | 57523.15 | 0 | 1 | 103681 |
| store | date | temperature | fuel_price | markdown1 | markdown2 | markdown3 | markdown4 | markdown5 | cpi | unemployment | dept | weekly_sales | holiday | type | size | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 412712 | 13 | 2012-10-26 | 46.97 | 3.755 | 10192.49 | 364.57 | 150.0 | 1714.15 | 5563.92 | 131.193097 | 5.621 | 41 | 2820.28 | 0 | 0 | 219622 |
| 412713 | 13 | 2012-10-26 | 46.97 | 3.755 | 10192.49 | 364.57 | 150.0 | 1714.15 | 5563.92 | 131.193097 | 5.621 | 42 | 7707.68 | 0 | 0 | 219622 |
| 412714 | 13 | 2012-10-26 | 46.97 | 3.755 | 10192.49 | 364.57 | 150.0 | 1714.15 | 5563.92 | 131.193097 | 5.621 | 44 | 12780.02 | 0 | 0 | 219622 |
| 412715 | 13 | 2012-10-26 | 46.97 | 3.755 | 10192.49 | 364.57 | 150.0 | 1714.15 | 5563.92 | 131.193097 | 5.621 | 46 | 38219.89 | 0 | 0 | 219622 |
| 412716 | 13 | 2012-10-26 | 46.97 | 3.755 | 10192.49 | 364.57 | 150.0 | 1714.15 | 5563.92 | 131.193097 | 5.621 | 48 | 1241.00 | 0 | 0 | 219622 |
| 412717 | 13 | 2012-10-26 | 46.97 | 3.755 | 10192.49 | 364.57 | 150.0 | 1714.15 | 5563.92 | 131.193097 | 5.621 | 49 | 7770.71 | 0 | 0 | 219622 |
| 412718 | 13 | 2012-10-26 | 46.97 | 3.755 | 10192.49 | 364.57 | 150.0 | 1714.15 | 5563.92 | 131.193097 | 5.621 | 50 | 1486.00 | 0 | 0 | 219622 |
| 412719 | 13 | 2012-10-26 | 46.97 | 3.755 | 10192.49 | 364.57 | 150.0 | 1714.15 | 5563.92 | 131.193097 | 5.621 | 52 | 4738.93 | 0 | 0 | 219622 |
| 412720 | 41 | 2012-10-26 | 41.80 | 3.686 | 4864.30 | 101.34 | 250.6 | 47.24 | 1524.43 | 199.219532 | 6.195 | 4 | 32699.78 | 0 | 0 | 196321 |
| 412721 | 45 | 2012-10-26 | 58.85 | 3.882 | 4018.91 | 58.08 | 100.0 | 211.94 | 858.33 | 192.308899 | 8.667 | 98 | 1076.80 | 0 | 1 | 118221 |